Why selecting Pet Adoption?

According to the American Society for the Prevention of Cruelty to Animals (ASPCA), around 4 million dogs enter shelters every year (ASPCA, n.d.). Many companion animal seekers acquire their new family members from shelters (Weiss, Miller, Mohan-Gibbons, & Vela, 2012). Meanwhile, Pet Food Manufacturers Association (2012) reported that almost 47%... Read More

Problem To Be Solved

In this article, I will evaluate the animal adoption speed from the Kaggle website. The effects of the physical characteristics of age, gender, breed, color, fur length, maturity size, health condition, whether the pets are being vaccinated, dewormed or sterilized and using machine learning tools to analyze associations between description... Read More

Data

overview and cleaning

PetFinder.my Adoption Prediction is acquired from Kaggle.com (PetFinder.my, 2019). The original row datasets contain 25 features and 14,993 samples in train.csv and 24 features and 3,972 samples in test.csv. Through the data cleaning process, the cleaned datasets contain 21 features and 14,993 samples in train.csv and 20 features and 3,972... Read More

Visualization

correlation

correlation among variables Visualization on variables provides multi-dimensional information to obtain some basic ideas of the relationship between categorical variables and ‘AdoptionSpeed’ variable. The correlational plot on all variables is created to assess the positive or negative relations among variables. Figure1... Read More

Machine Learning Application

Multinomial Naive Bayes & Random Forest

The two models on NLP chosen for ‘Description’ variable and ‘AdoptionSpeed’ variable are Multinomial Naïve Bayes Classifier and ensemble method classification. On the first try, Multinomial Naïve Bayes classifier is applied since it provides a nice baseline for several variants of a classifier, the multinomial variant will be... Read More